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Abstract 

We analyse the structure of the distribution of eigenvalues of the stock market 
correlation matrix with increasing length of the time series representing the price 
changes. We use 100 highly-capitalized stocks from the American market and relate 
result to the corresponding ensemble of Wishart random matrices. It turns out 
that systematically more eigenvalues stay beyond the borders prescribed by this 
variant of the Random Matrix Theory (RMT). This may indicate that even the 
bulk of the spectrum of the stock market correlation matrix carries some sort of 
correlations that are masked by a measurement noise when the time series used 
to construct the matrix are short. We also study some other characteristics of the 
"noisy" eigensignals, like their return distributions, temporal correlations or their 
multifractal spectra and the results support the above conclusions. 
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1 Introduction 



The formalism of correlation matrix is widely used in contemporary finance 
both on theoretical and on practical level in order to estimate correlation 
structure of a financial market or in order to create portfolios characterized 
by given properties. This formalism is especially important for various risk 
management techniques like e.g. the Markowitz optimal portfolio theory [1] 
which helps one constructing portfolios offering minimal risk at a given re- 
turn or maximal return at a given risk. This theory uses eigenvalues of the 
correlation matrix as a measure of the portfolio's risk: no matter which eigen- 
value one uses, it is fully informative and a corresponding portfolio is therefore 
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valid. However, recently such understanding of the portfolio selection has been 
strongly challenged in a series of papers linking the correlation matrix formal- 
ism and the portfolio theories with Random Matrix Theory (RMT) and the 
ensemble of Wishart matrices (sample random correlation matrices) in par- 
ticular [2,3]. Accordingly, the current view is that only a few eigenvalues and 
eigenvectors of the correlation matrix are important in practice and carry any 
significant information about the market, while all others describe nothing 
more than pure noise. Thus, only those few portfolios which correspond to 
the non-random eigenstates of the correlation matrix can be in fact regarded 
as a potential investment target. This observation profoundly restricting the 
applicability of classical portfolio theories rises a question of how to recon- 
cile the two contradicting views on the RMT-like portfolios. This problem is 
at present one of central issues of econophysics and is intensively studied by 
many different groups. One of directions in which current analyses go is the 
so-called denoising of the correlation matrices which aims at removing the es- 
timation errors of the correlations due to finite size of empirical data [4,5,6,7]. 
Another possible approach is to look into the empirical data and to answer the 
question what is there really random and what is not. In order to do this, one 
can firstly investigate the properties of dynamics of different portfolios calcu- 
lated by using the correlation matrix framework, to compare characteristics 
of random and non-random ones, and, possibly, to identify those signatures 
of the dynamics which represent some non-random phenomena even in the 
case of the RMT-like eigenstates. In the present paper we would like to follow 
this idea and draw some conclusions based on data from the American stock 
market. 



2 Formalism 

In general, a portfolio P consists of a number of securities X s , s = 1, N as- 
sociated with weights w s characterizing the fraction of total amount of capital 
invested in a particular security. Return of such a portfolio after time At is the 
weighted sum of logarithmic price increments g s (At) = In p s (t + At) — lnp s (t) 
of individual securities X s : 



By fixing the time scale At and creating a time series of length T from consec- 
utive discrete-time portfolio's returns {Gp(j)}J =l it is also possible to investi- 
gate dynamics of the portfolio in time. Although every possible set of weights 
define certain portfolio, from an investor's perspective only those portfolios 
which are characterized by specific predefined properties can be of practical 
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Fig. 1. Empirical eigenvalue spectrum of the correlation matrix C (vertical lines), 
calculated for 100 highly capitalized American companies over the period 1998-1999; 
the eigenvalues of a random Wishart matrix with the same Q may lie only within 
the shaded vertical region (a). Eigenvalue spectrum after effective rank reduction of 
C, i.e. after subtracting the contribution of the most collective eigensignal Z\ (b) 
and the two most collective ones Z\ and Zi (c). 

interest. For example, a portfolio which is least risky among family of ones 
offering the given future return Gp. Risk R(P) is usually quantified in terms 
of variance of the time series of historical returns 



R(P) = a 2 {P)=v^{G P { 3 )}] 



(2) 



According to the classical Markowitz theory, this risk can be related to corre- 
lations (or covariances) between the time series of individual security returns 
9s(j),j = I; ■■■,T for the relevant group of securities. 

More specifically, let one consider a set of A" securities (e.g. stocks) each rep- 
resented by a time series of normalized returns g s (j),s = 1, N; j = 1, T. 
From these time series an N x T data matrix M can be created and then a 
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Fig. 2. Empirical eigenvalue spectrum of correlation matrix C' (see text) after sub- 
tracting the contribution of the two most collective components associated with Ai 
and A2 for four different values of Q; (a) is the same as Fig. 1(c). Shaded regions 
correspond to RMT predictions for given Q. 

correlation matrix C according to the formula 



C = (1/T)MM T . 



(3) 



Each element of C is obviously the Pearson correlation coefficient C m ^ n be- 
tween a pair of signals m and n. The correlation matrix can be diagonalized 
by solving the eigenvalue problem 

Cxi = A iX i, i = l,...,N. (4) 



From the point of view of investment theories, each eigenvector x, can be 
considered as a realization of an iV-security portfolio Pi with the weights equal 
to the eigenvector components x\ , k = 1, N. For a non-degenerate matrix 
C, Pi and Pj are independent for each pair of their indices, which allows one 
to choose such a portfolio, whose risk is independent of others. 
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Fig. 3. Standard deviation cr(Zi) for all eigensignals Z% (top panel) together with 
the corresponding eigenvalues \ (bottom panel). Almost perfect correspondence 
between both panels is visible; the difference in units in the vertical axes is due to 
eigenvector normalization performed by diagonalization procedure. 

From a dynamical perspective, each eigenvector (and thus portfolio) can be 
associated with the corresponding time series of the portfolio's returns by the 
expression analogous to Eq.(l) 



N 



z i(f) = J2 x i 9k(j), i = j = 1,.-,T. 

k=i 



(5) 



These time series we shall call the eigensignals (see also [8,9] for an alter- 
native realization). One of the most important properties of such eigensignals 
is that their risk can be easily related with the corresponding eigenvalues: 



R(P i ) = a 2 (Z t ) = xJCx l = \ t , 



(6) 



Thus, the eigenvalue size is a risk measure and, in consequence, the larger Aj, 
the larger variance of Zi and also the larger risk of the corresponding portfolio 



Pi- 



3 Results 



We analyzed high-frequency data from the American stock market [10] in the 
period 1 Dec 1997 — 31 Dec 1999. In order to calculate inter-stock correlations 
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Fig. 4. Histograms of eigenvector components x\ for different eigenvalues Aj. A 
Gaussian is fitted to the empirical histograms corresponding to the most random 
eigenvectors. 



we chose a set of stocks of N = 100 highly capitalized companies listed in 
NYSE or NASDAQ (capitalization > $10 10 in each case). These stocks are 
frequently traded (0.01-1 transactions/s) and hence the time scale of At = 5 
min allowed us to perform a statistically significant analysis; such a short time 
scale was also desired because of length of time series (approx. 40,000 data 
points). Typically, the portfolio analysis is performed on daily data; usually 
this time scale is recommended because of the fact that at this scale the 
correlations present at the market are well-developed. However, restricting the 
analysis to such long At seems not to be necessary in contemporary markets, 
in which both pieces of information arrive more frequently and the investors 
react to them quicker than in the past. This obviously leads to acceleration 
of the market time paced by consecutive transactions. Thus, as recent papers 
document [13,14], for highly capitalized stocks which are also among the most 
frequently traded ones, the correlations reach their saturation level at the 
time horizons much shorter than a day. In case of our data this time horizon 
corresponds to about 30 min, while for At = 5 min the correlations associated 
with Ai are clearly very pronounced and informative and Ai assumes approx. 
2/3 of its saturation level [14]. 
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3.1 Eigenvalue distribution 



The eigenvalue distribution offers a representative and the most frequently 
used measure to quantify the characteristics of matrices, especially in the 
context of relating them to RMT. Let us therefore start presentation of the 
results with the eigenvalue spectrum of the correlation matrix. Figure 1(a) 
shows all 100 eigenvalues distributed along the horizontal axis, denoted by 
vertical lines. The largest eigenvalue Ai ~ 18, repelled from the rest of the 
spectrum, describes the collective eigenstate which can be identified with the 
market. A2 — 2.4 absorbs some collectivity either, but its magnitude is by 
an order of magnitude smaller than Ai and can be related with some branch- 
specific factor (the same applies to a few next eigenvalues). Figure 1(a) displays 
that only a small fraction of Aj's falls within the RMT region defined by its 
bounds: A™f* = 1 + 1/Q ± 2/y / (5 (shaded vertical region in Figure), where 
Q = T/N ~ 406 [15]. However, due to the fact that TrC = 100, the existence 
of strong collective components can effectively supress the noisy part of the C 
eigenspectrum, shifting smaller eigenvalues towards zero. Therefore, in order 
to correct for this effects it is recommended to remove the market factor Z\ 
from the data [11]. This can be done by means of the least square fitting of 
this factor represented by Z\(j) to each of the original stock signals gk(j) : 

gk(j) = a k + (3 k z 1 (j) + e { l }\j), (7) 

where a iy Pi are parameters, and then we can construct a new correlation 
matrix from the residuals e^\j) (e.g. ref. [2,11]). Now significantly more 
eigenvalues fall within the shaded RMT region as Figure 1(b) documents. 
This can be done once again and the A2 component can also be removed 
leading to the eigenspectrum presented in Figure 1(c). In fact, now many 
more eigenvalues (7 = 49%) overlap with the RMT interval < A m i n , A max >, 
though, interestingly, this value is qualitatively different from results presented 
earlier in [2,11,8] where vast majority of the eigenvalues was inside the RMT 
bounds. 

In order to shed some light on this problem we note that in both cited works 
the parameter Q was much smaller than in our case: Q — 4.1 and 6.4 in 
ref. [11] and Q = 3.2 in [2] and therefore the RMT spectrum was respectively 
wider. By manipulating the Q value for our data (we divide the time series 
into windows of length T' with predefined Q' = T'/N, then we calculate C for 
each window and average its eigenspectrum over all the windows) we obtain 
the correlation matrix eigenspectrum which can be compared with the original 
one for the undivided time series (Q = 406). Since in each case the average Ai 
is strongly repelled (and its value is approximately the same as in Fig. 1(a)), 
we follow the earlier procedure and remove the collective components related 
to both Ai and A 2 for each window before averaging the resulting eigenspectra. 
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Fig. 5. Reciprocal of inverse participation ratio (Eq.(8)) for all 100 eigenvectors of 
the correlation matrix. Noise level for random Gaussian case is also presented. 



Figure 2 shows such decollectified average eigenspectra for four different values 
of Q. What is immediately evident, the wider the shaded RMT region, the 
more eigenvalues it overlaps with. For the smallest presented Q = 3 as much 
as 7 = 89% eigenvalues fall within the RMT realm which is compatible with 
7 = 94% from ref. [2]. Naturally, as we verified by an explicit calculation, 
this observation obtained for At = 5 min qualitatively remains unaltered if 
we pass to longer time scales e.g. At = 60 min, despite the fact that the 
corresponding time series shorten and the maximum available Q decreases. 
We conclude that for a typical realization of Q < 10 in practical applications 
(usually large N and relatively small T as it happens for daily data), only 
the largest eigenvalues are able to deviate from the RMT predictions, while 
the other eigenvalues possibly carrying some more subtle correlations may be 
forced to spuriously merge with the random bulk. This purely statistical effect 
suggests that the random part of the C eigenspectrum can in fact comprise 
non-random components which can be discerned from noise only if one uses 
data with larger Q. This is in favour of using data also with frequencies higher 
than the daily one as a tool for denoising C. (It is noteworthy that a parrallel 
effect of shifting the small non-random eigenvalues (Aj m A m i n ) into the RMT 
interval has been presented recently in ref. [12].) 
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Fig. 6. Time series of the eigensignals for Ai (top) and A25 (bottom). Note different 
scales in vertical axes of both panels. 

3.2 Eigenvector properties 

Figure 3 illustrating the strong relation between the risk and the eigenvalues 
(Eq.(6)), displays the eigensignal variance calculated for each Zi (upper panel) 
and all the eigenvalues of C (lower panel). Essentially no significant qualitative 
difference between these two quantities can be found, exactly as expected. 
Apart from the eigenvalue spectrum, RMT offers useful predictions regarding 
distribution of the eigenvector components for a completely random matrix, 
which assumes the form of the Porter-Thomas (Gaussian) distribution. In 
contrast, in the case of a collective non-random eigenvector, there exists some 
kind of vector localization or derealization. Figure 4 presents distributions of 
the components for typical eigenvectors of our matrix and for a few specific 
cases. The eigenvector corresponding to the largest eigenvalue Ai is completely 
delocalized because all its components are roughly the same and the associated 
distribution is centered at 0.1. This is standard situation and in evolution 
of the stock prices this eigenvector represents the market factor. Also non- 
random is the eigenvector for A2 with the components distribution still far 
from Gaussian. A trace of randomness however occurs already for A3 and is 
clear for a bulk of eigenvectors in the next panel of Figure 4. On the other 
hand, for the extremely small eigenvalues, localization can be perfectly seen. 

One of the key properties of the eigenvector (and thus also the eigenstate Zj) 
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is the effective number of its large components. A related measure is inverse 
participation ratio 



and its reciprocal ("participation ratio"). Figure 5 presents this latter 
quantity calculated for all the eigenvectors together with its value N/3 for ran- 

(k) 

dom case in which x\ are taken independently from normal distribution. For 
% — 1 almost all the companies contribute to the corresponding eigenvector Z\, 
which justifies treating this eigenvector as the market factor. The eigenvectors 
for a few smaller eigenvalues show also slightly higher number of participating 
companies than for the random case but, in contrast, the eigensignals associ- 
ated with the smallest 20 eigenvalues allow one to characterize them as the 
components related to only few stocks. The rest of the eigenvectors seem to 
be random, with small deviation from the predicted value of N/3 probably 
due to the existence of fat tails of the returns distributions. 

Figure 6 presents the time series of the eigensignal returns zi(j) calculated 
according to Eq.(5) for Ai and A 2 5- Curiously, if one compares both series 
visually, forgetting the difference in vertical axis, it could be quite difficult 
to point out which of the two corresponds to the most collective eigenstate. 
Both eigensignals are nonstationary with likely extreme fluctuations and both 
of them reveal also volatility clustering. Thus, one can infer that there are 
statistical properties which are invariant under change of the eigenstates with 
only minor differences between collective and noisy eigenstates. 

In agreement with Figure 3, c.d.f. of the eigensignal returns (Figure 7(a)) show 
that Z 1 is characterized by much broader distribution than other eigensignals, 
and that the same, but to a lesser extent, is true also for Z 2 . It is interesting 
that, except for Z\, all the eigensignals are associated with the distributions 
with the power law scaling in tails, almost exactly like for the original stock 
returns (see e.g. [13,16,17,18]). Only Z 1 presents different behaviour: a short 
range of power law scaling and significant deviation from this behaviour for 
\z±\ > 5o". This can be even more convincing if all the signals are normalized 
to unit variance (Figure 7(b)); the power law slope with a ~ 3 is typical for 
all the eigensignals (although for a considerably larger At exactly this kind of 
scaling may not be observed [13], the eigensignal c.d.f.s also preserve tails of 
the individual stock returns c.d.f.s for the same At). The non-typical shape of 
c.d.f. for Z 1 in Figure 7(b) can originate from the fact that this eigensignal is 
composed as an average of almost 100 individual stocks, while on average 1/3 
of this number of stocks contribute to other eigensignals; this is why Central 
Limit Theorem leaves its fingerprints presumably on Z\. However, this cannot 
be considered as a rule, because for significantly larger time scales (e.g. At = 
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Fig. 7. (a) Cumulative distribution functions for the eigensignal returns for all Z^ 
(i = 1,...,N). Distributions for Z\ (thick solid) and Z2 (dot-dashed) are distin- 
guished. Inverse cubic power law (a = 3.0) is also denoted by dashed line, (b) The 
c.d.f. for the eigensignal returns after normalization to unit variance and averaging 
over Zi,i = 2, 100 (dashed) together with c.d.f. for Z\ (solid). 
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Fig. 8. Volatility autocorrelation function C(\Zi\,r) for i = 1,...,100. For each 
eigensignal Zj, daily trend was removed by the standard detrending procedure 
(see [31,32]). 

60 min) or for some other groups of stocks for which the cross-correlations 
are more intensive such a peculiarity of the Zx's c.d.f. might not be observed. 
Noteworthy to mention is that the inter-stock correlations responsible for a 
transfer of scaling [17] from the stock returns to the eigensignal distributions 
in Figure 7 are mainly linear (thus detectable by the correlation matrix). 
However, the very fact that scaling exists at At = 5 min or at longer time 
scales is an effect of highly nonlinear temporal correlations present in the stock 
returns (the volatility autocorrelations etc.) which may also be detectable in 
the eigensignals (see the next subsection). 



3.3 Temporal correlations 

Now let us look at temporal correlation properties of the eigensignals. Figure 
8 shows the volatility autocorrelation function c(\Zi\;r) = c(\zi(j)\, \zi(j + t)\) 
for all the eigensignals Zj. The difference between Z\ and the rest of signals 
is pronounced and resembles the corresponding difference in the case of the 
eigensignals variance (Figure 3). In fact, memory in Z\ is about two orders 
of magnitude longer than for the other eigensignals (for m > 1: c(\Zi\; r) ~ 
c(|Zi|; 10 2 * r)). However, it cannot be said that Z\ (the market component) 
absorbs all the memory of the evolution of stock prices. The volatility auto- 
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Fig. 9. Cross-correlation function c(Zi,\Zi\;r) for time series of returns Zi and of 
volatility \Z{\ calculated for all eigensignals separately. The resulting functions for 
the eigensignals corresponding to three largest eigenvalues are distinguished by thick 
lines. 



correlation function for the other eigensignals decay very slowly in time as 
well. 



Another interesting sort of nonlinear correlations are cross-correlations be- 
tween returns and volatility c(Z iy \Zi\;r). It has been observed that past re- 
turns imply negatively correlated future moves in volatility, the so-called lever- 
age effect [19,20,21]. Figure 9 shows the cross-correlation function c(Zi, \Z^\r) 
for the returns and the volatility for all the eigensignals. The leverage effect 
can be detected easily only for i = 1, 2, 3; all other eigensignals do not reveal 
it. For i — 1 and i — 2 the correlations are qualitatively the same despite the 
earlier-presented differences between the eigensignals. For i = 3 negative cor- 
relations do not exist for the smallest values of r, while they are the strongest 
for t > 100. In order to carry out this calculation we took signals for At = 20 
min, because analogous signals at smaller time scales are too noisy to show the 
strong leverage effect. Interestingly, for the original signals g s corresponding 
to individual stocks s, the leverage effect is much weaker and even difficult to 
observe at all. 
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3.4 Multifractal characteristics 



One of the consequences of the broad probability distributions (Figure 7) and 
of the nonlinear correlations (Figure 8 and 9) is multifractal character of sig- 
nals, i.e. the existence of a continuous spectrum of scaling indices f(a). It has 
already been shown in numerous works that stock returns form signals which 
are multifractal both on daily and on high-frequency time scales [22,23,24,25,26,27,28,29,30,32]. 
This behaviour can be even modeled with a good agreement if one introduces 
statistical processes based on multiplicative cascades [21,33,34,35,36]. Due to 
the fact that, by definition, the eigensignals are calculated as a sum of stock 
returns, the multifractality of their components can be transferred to the re- 
sulting eigensignal. Therefore one can expect that at least some of Zj's are also 
multifractal; there can exist differences in their singularity spectra because of 
different stock compositions for different i's, though. Owing to superiority 
of Multifractal Detrended Fluctuation Analysis (MFDFA) [37] over Wavelet 
Transform Modulus Maxima (WTMM) method [38] in the case of financial 
data [39], we applied former one to our time series in order to calculate the 
singularity spectra f(a). 

We start from our eigensignal % represented by the time series Zi(j) of length 
N es and estimate the signal profile 



where < ... > denotes averaging over Zi(k). In the next step Y is divided into 
M es segments of length n (n < N es ) starting from both the beginning and 
the end of the time series so that eventually there are 2M es segments. In each 
segment v we fit a Z-th order polynomial to the data, thus removing a 
local trend. Then, after calculating the variance 
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and averaging it over z/'s, we get the qth order fluctuation function 




(11) 



for all values of n. The most important property of F q {n) is that for a signal 
of the fractal character it obeys a power-law functional dependence on n: 
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Fig. 10. (a) Singularity spectra f(a) for eigensignals corresponding to all correlation 
matrix eigenvalues Aj. Spectrum for the eigensignal Z\ associated with the largest 
eigenvalue is denoted by a thick solid line; (b) singularity spectrum for Z\ (solid) 
compared with the average spectrum for all other eigensignals Zi, i = 2, ...,100 
(dashed). 
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Fig. 11. Ah COTT vs. A/i ranc i for eigensignals corresponding to 30 largest eigenvalues. 
Symbols denoting Z\, Z 4 are labeled. 

at least for some range of n. As a result of complete MF-DFA procedure we 
obtain a family of generalized Hurst exponents h(q), which form a decreasing 
function of q for a multifractal signal or are independent of q for a monofractal 
one. A more convenient way to present the fractal character of data graphically 
is to calculate the singularity spectrum f(a) by using the following relations: 

a = h(q)+qh'(q) f(a) = q[a-h(q)] + l. (13) 



We computed the f(ct) spectra with MF-DFA for all time series Z± and for 
At = 5 min (shorter time scales are too noisy, while longer ones are repre- 
sented by too short series for a reliable estimation of f(ct)) and the corre- 
sponding results are shown in Figure 10(a). Wide spectra prove that all Zi's 
are multifractal with only minor differences in widths of the spectra for dif- 
ferent eigensignals. Although positions of maxima of the spectra vary, there is 
no significant i-dependence both in the positions and in shape of the spectra. 
It is interesting that even the most collective and correlated eigensignal for 
Ai (the solid, distinguished line in Figure 10(a)) does not develop spectrum 
which could deviate from the typical one. This is even more evident if we 
compare the spectrum for Z\ with the average spectrum calculated from all 
other Zj's in Figure 10(b). Such similarity of multifractal properties of the 
eigensignals representing completely different correlation structure and differ- 
ent p.d.f.'s can suggest that the eigensignals corresponding to a few largest 
Aj's and the ones corresponding to the bulk of the eigenspectrum are in fact 
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much more similar to each other than it is usually assumed. This also bears a 
serious concern regarding the justification of treating the "noisy" eigenstates 
as completely random without any information content. 

As we already know the autocorrelations of volatility \Zj\ for i = 1 are dif- 
ferent from those for % > 1 (Figure 8), while the normalized returns have 
similar c.d.f.'s for all i. Now we would like to compare the fractal properties 
of different eigensignals in order to separate the two sources of multifractality: 
the broad distributions of returns and the correlations, and to compare the 
singularity spectra f(a) for these sources. We follow the idea of [40] and we 
study variability of generalized Hurst exponents h(q) for the actual and the 
reshuffled eigensignals. If we denote the generalized Hurst exponent for the 
randomized signal by h ran d(q), its correlation counterpart reads 

KoAq) ■■= h(q) - h iand (q). (14) 

Variability of h(q) can be expressed by the difference 

Ah := h(q min ) - h(q max ) (15) 



and, analogously, the variability of h rSuIld (q) and h corr (q). Each symbol in Fig- 
ure 11 presents positions of the eigensignals Z { in (A/i corr , A/i rand ) coordinates. 
Eigensignals corresponding to the four largest eigenvalues are labelled. The 
higher value of A/i ran d, the richer the multifractal behaviour due to the fat- 
tailed probability distributions of returns. Analogously, high values of Ah corr 
indicate strong contribution of the temporal correlations. These temporal cor- 
relations implying the observed multifractal behaviour of signals have to be 
strongly nonlinear: our earlier calculations [32] showed that even the volatility 
autocorrelations that are nonlinear in returns are not capable of producing 
this phenomenon. Furthermore, it can be easily inferred from Figure 11 that 
there is nothing characteristic in the positions of the symbols related to -^i,...,4- 
Results collected in Figure 10(b) and Figure 11 indicate that the multifrac- 
tal analysis cannot point out any essential differences between the collective 
eigenstates and the noisy ones. 



4 Conclusions 



We analysed the eigensignals corresponding to different eigenvalues of the em- 
pirical correlation matrix calculated for the 100 highly-capitalized American 
companies. From a practical point of view, these eigensignals represent tem- 
poral evolution of specific portfolios defined by the corresponding eigenvectors 
of C. We showed that despite the important differences in interpretation of 
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different eigensignals, Zj's for the collective and for the noisy eigenvalues can 
reveal different or similar statistical properties depending on a particular quan- 
tity. What differs most is risk expressed by variance of an eigensignal, which 
is very high for the most collective Z ± , is also significant for Z 2 , and is much 
smaller for the rest of the Zj's. This is closely related to the eigenvector prop- 
erties, which are different for highest and other Aj's. The risk is also related 
to the width of the returns distributions, preventing the eigensignals for small 
eigenvalues from high fluctuations and the associated portfolios from large 
losses. The second group of quantities which reveal different values for differ- 
ent Zj's are nonlinear correlations, both those in volatility and those between 
returns and volatility. On the other hand, there are properties that remain 
unaltered when going from small to large values of i. The most interesting 
one is the multifractality of all eigensignals which is surprisingly roughly the 
same for different Zj's. Curiously, this happens even if the above-mentioned 
correlations vary among the eigensignals. One of possible sources of this can 
be similar shape of tails of the probability distributions, which after normal- 
izing the eigensignal returns show the inverse cubic scaling for all %. Both the 
fat-tailed distributions of returns and the multifractal character of eigensignals 
for each eigenvalue lead to a conclusion that the noisy eigenstates might not 
be so random as they are usually regarded by their relation to RMT. Rich 
multifractal dynamics of the eigensignals corresponding to even the random 
part of the eigenvalue spectrum suggests that strong nonlinear correlations are 
present in the temporal evolution of each portfolio giving it significance which 
exceeds pure noise. This conclusion is strongly supported by the observation 
(Section 3.1) that many real correlations may be masked by noise due to too 
short signals considered in practical applications. 
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